智能论文笔记

NVS-MonoDepth: Improving Monocular Depth Prediction with Novel View Synthesis

Zuria Bauer , Zuoyue Li , Sergio Orts-Escolano , Miguel Cazorla , Marc Pollefeys , Martin R. Oswald

分类：计算机视觉

2021-12-22

建立新型观点综合的最近进展后，我们提出了改善单眼深度估计的应用。特别是，我们提出了一种在三个主要步骤中分开的新颖训练方法。首先，单眼深度网络的预测结果被扭转到额外的视点。其次，我们应用一个额外的图像综合网络，其纠正并提高了翘曲的RGB图像的质量。通过最小化像素-WISE RGB重建误差，该网络的输出需要尽可能类似地查看地面真实性视图。第三，我们将相同的单眼深度估计重新应用于合成的第二视图点，并确保深度预测与相关的地面真理深度一致。实验结果证明，我们的方法在Kitti和Nyu-Deaft-V2数据集上实现了最先进的或可比性，具有轻量级和简单的香草U-Net架构。

translated by 谷歌翻译

Domain Adaptation of Transformer-Based Models using Unlabeled Data for Relevance and Polarity Classification of German Customer Feedback

Ahmad Idrissi-Yaghir , Henning Schäfer , Nadja Bauer , Christoph M. Friedrich

分类：自然语言处理 | 机器学习

2022-12-12

Understanding customer feedback is becoming a necessity for companies to identify problems and improve their products and services. Text classification and sentiment analysis can play a major role in analyzing this data by using a variety of machine and deep learning approaches. In this work, different transformer-based models are utilized to explore how efficient these models are when working with a German customer feedback dataset. In addition, these pre-trained models are further analyzed to determine if adapting them to a specific domain using unlabeled data can yield better results than off-the-shelf pre-trained models. To evaluate the models, two downstream tasks from the GermEval 2017 are considered. The experimental results show that transformer-based models can reach significant improvements compared to a fastText baseline and outperform the published scores and previous models. For the subtask Relevance Classification, the best models achieve a micro-averaged $F1$-Score of 96.1 % on the first test set and 95.9 % on the second one, and a score of 85.1 % and 85.3 % for the subtask Polarity Classification.

translated by 谷歌翻译

Topologically faithful image segmentation via induced matching of persistence barcodes

Nico Stucki , Johannes C. Paetzold , Suprosanna Shit , Bjoern Menze , Ulrich Bauer

分类：计算机视觉 | 机器学习

2022-11-28

Image segmentation is a largely researched field where neural networks find vast applications in many facets of technology. Some of the most popular approaches to train segmentation networks employ loss functions optimizing pixel-overlap, an objective that is insufficient for many segmentation tasks. In recent years, their limitations fueled a growing interest in topology-aware methods, which aim to recover the correct topology of the segmented structures. However, so far, none of the existing approaches achieve a spatially correct matching between the topological features of ground truth and prediction. In this work, we propose the first topologically and feature-wise accurate metric and loss function for supervised image segmentation, which we term Betti matching. We show how induced matchings guarantee the spatially correct matching between barcodes in a segmentation setting. Furthermore, we propose an efficient algorithm to compute the Betti matching of images. We show that the Betti matching error is an interpretable metric to evaluate the topological correctness of segmentations, which is more sensitive than the well-established Betti number error. Moreover, the differentiability of the Betti matching loss enables its use as a loss function. It improves the topological performance of segmentation networks across six diverse datasets while preserving the volumetric performance. Our code is available in https://github.com/nstucki/Betti-matching.

translated by 谷歌翻译

A Generative Approach for Production-Aware Industrial Network Traffic Modeling

Alessandro Lieto , Qi Liao , Christian Bauer

分类：机器学习

2022-11-11

The new wave of digitization induced by Industry 4.0 calls for ubiquitous and reliable connectivity to perform and automate industrial operations. 5G networks can afford the extreme requirements of heterogeneous vertical applications, but the lack of real data and realistic traffic statistics poses many challenges for the optimization and configuration of the network for industrial environments. In this paper, we investigate the network traffic data generated from a laser cutting machine deployed in a Trumpf factory in Germany. We analyze the traffic statistics, capture the dependencies between the internal states of the machine, and model the network traffic as a production state dependent stochastic process. The two-step model is proposed as follows: first, we model the production process as a multi-state semi-Markov process, then we learn the conditional distributions of the production state dependent packet interarrival time and packet size with generative models. We compare the performance of various generative models including variational autoencoder (VAE), conditional variational autoencoder (CVAE), and generative adversarial network (GAN). The numerical results show a good approximation of the traffic arrival statistics depending on the production state. Among all generative models, CVAE provides in general the best performance in terms of the smallest Kullback-Leibler divergence.

translated by 谷歌翻译

FED-CD: Federated Causal Discovery from Interventional and Observational Data

Amin Abyaneh , Nino Scherrer , Patrick Schwab , Stefan Bauer , Bernhard Schölkopf , Arash Mehrjou

分类：机器学习

2022-11-07

Causal discovery, the inference of causal relations from data, is a core task of fundamental importance in all scientific domains, and several new machine learning methods for addressing the causal discovery problem have been proposed recently. However, existing machine learning methods for causal discovery typically require that the data used for inference is pooled and available in a centralized location. In many domains of high practical importance, such as in healthcare, data is only available at local data-generating entities (e.g. hospitals in the healthcare context), and cannot be shared across entities due to, among others, privacy and regulatory reasons. In this work, we address the problem of inferring causal structure - in the form of a directed acyclic graph (DAG) - from a distributed data set that contains both observational and interventional data in a privacy-preserving manner by exchanging updates instead of samples. To this end, we introduce a new federated framework, FED-CD, that enables the discovery of global causal structures both when the set of intervened covariates is the same across decentralized entities, and when the set of intervened covariates are potentially disjoint. We perform a comprehensive experimental evaluation on synthetic data that demonstrates that FED-CD enables effective aggregation of decentralized data for causal discovery without direct sample sharing, even when the contributing distributed data sets cover disjoint sets of interventions. Effective methods for causal discovery in distributed data sets could significantly advance scientific discovery and knowledge sharing in important settings, for instance, healthcare, in which sharing of data across local sites is difficult or prohibited.

translated by 谷歌翻译

FoVolNet: Fast Volume Rendering using Foveated Deep Neural Networks

David Bauer , Qi Wu , Kwan-Liu Ma

分类：机器学习

2022-09-20

在许多重要的科学和工程应用中发现了卷数据。渲染此数据以高质量和交互速率为苛刻的应用程序（例如虚拟现实）的可视化化，即使使用专业级硬件也无法实现。我们介绍了Fovolnet - 一种可显着提高数量数据可视化的性能的方法。我们开发了一种具有成本效益的渲染管道，该管道稀疏地对焦点进行了量度，并使用深层神经网络重建了全帧。 FOVEATED渲染是一种优先考虑用户焦点渲染计算的技术。这种方法利用人类视觉系统的属性，从而在用户视野的外围呈现数据时节省了计算资源。我们的重建网络结合了直接和内核预测方法，以产生快速，稳定和感知令人信服的输出。凭借纤细的设计和量化的使用，我们的方法在端到端框架时间和视觉质量中都优于最先进的神经重建技术。我们对系统的渲染性能，推理速度和感知属性进行了广泛的评估，并提供了与竞争神经图像重建技术的比较。我们的测试结果表明，Fovolnet始终在保持感知质量的同时，在传统渲染上节省了大量时间。

translated by 谷歌翻译

Predicting the protein-ligand affinity from molecular dynamics trajectories

Yaosen Min , Ye Wei , Peizhuo Wang , Nian Wu , Stefan Bauer , Shuxin Zheng , Yu Shi , Yingheng Wang , Dan Zhao , Ji Wu

分类：机器学习

2022-08-19

准确的蛋白质结合亲和力预测在药物设计和许多其他分子识别问题中至关重要。尽管基于机器学习技术的亲和力预测取得了许多进步，但由于蛋白质 - 配体结合取决于原子和分子的动力学，它们仍然受到限制。为此，我们策划了一个包含3,218个动态蛋白质配合物的MD数据集，并进一步开发了DynaFormer，这是一个基于图的深度学习框架。 DynaFormer可以通过考虑相互作用的各种几何特征来完全捕获动态结合规则。我们的方法显示出优于迄今报告的方法。此外，我们通过将模型与基于结构的对接整合在一起，对热休克蛋白90（HSP90）进行了虚拟筛选。我们对其他基线进行了基准测试，表明我们的方法可以鉴定具有最高实验效力的分子。我们预计大规模的MD数据集和机器学习模型将形成新的协同作用，为加速药物发现和优化提供新的途径。

translated by 谷歌翻译

Development and Validation of ML-DQA -- a Machine Learning Data Quality Assurance Framework for Healthcare

Mark Sendak , Gaurav Sirdeshmukh , Timothy Ochoa , Hayley Premo , Linda Tang , Kira Niederhoffer , Sarah Reed , Kaivalya Deshpande , Emily Sterrett , Melissa Bauer

分类： (统计)机器学习 | 机器学习

2022-08-04

机器学习和临床研究社区利用现实世界数据（RWD）的方法，包括电子健康记录中捕获的数据（EHR）截然不同。虽然临床研究人员谨慎使用RWD进行临床研究，但用于医疗团队的ML会消费公共数据集，并以最少的审查来开发新算法。这项研究通过开发和验证ML-DQA来弥合这一差距，ML-DQA是基于RWD最佳实践的数据质量保证框架。 ML-DQA框架适用于两个地理位置的五个ML项目，分别是不同的医疗状况和不同的人群。在这五个项目中，共收集了247,536名患者的RWD，共有2,999项质量检查和24份质量报告。出现了五种可推广的实践：所有项目都使用类似的方法来分组冗余数据元素表示；所有项目都使用自动实用程序来构建诊断和药物数据元素；所有项目都使用了一个共同的基于规则的转换库；所有项目都使用统一的方法将数据质量检查分配给数据元素；所有项目都使用类似的临床裁决方法。包括临床医生，数据科学家和受训者在内的平均有5.8个人参与每个项目实施ML-DQA，每个项目平均进行了23.4个数据元素。这项研究证明了ML-DQA在医疗项目中的重要性作用，并为团队提供了开展这些基本活动的框架。

translated by 谷歌翻译

Instant Neural Representation for Interactive Volume Rendering

Qi Wu , Michael J. Doyle , David Bauer , Kwan-Liu Ma

分类：机器学习

2022-07-23

神经网络在压缩体积数据以进行科学可视化方面表现出巨大的潜力。但是，由于训练和推断的高成本，此类体积神经表示仅应用于离线数据处理和非交互式渲染。在本文中，我们证明，通过同时利用现代的GPU张量核心，本地CUDA神经网络框架以及在线培训，我们可以使用体积神经表示来实现高性能和高效率交互式射线追踪。此外，我们的方法是完全概括的，可以适应时变的数据集。我们提出了三种用于在线培训的策略，每种策略都利用GPU，CPU和核心流程技术的不同组合。我们还开发了三个渲染实现，允许交互式射线跟踪与实时卷解码，示例流和幕后神经网络推断相结合。我们证明，我们的体积神经表示可以扩展到Terascale，以进行常规网格体积可视化，并可以轻松地支持不规则的数据结构，例如OpenVDB，非结构化，AMR和粒子体积数据。

translated by 谷歌翻译

Latent Variable Models for Bayesian Causal Discovery

Jithendaraa Subramanian , Yashas Annadani , Ivaxi Sheth , Stefan Bauer , Derek Nowrouzezahrai , Samira Ebrahimi Kahou

分类：机器学习 | 人工智能 | (统计)机器学习

2022-07-12

不依赖虚假相关性的学习预测因素涉及建立因果关系。但是，学习这样的表示非常具有挑战性。因此，我们制定了从高维数据中学习因果表示的问题，并通过合成数据研究因果恢复。这项工作引入了贝叶斯因果发现的潜在变量解码器模型BCD，并在轻度监督和无监督的环境中进行实验。我们提出了一系列合成实验，以表征因果发现的重要因素，并表明将已知的干预靶标用作标签有助于无监督的贝叶斯推断，对线性高斯添加噪声潜在结构性因果模型的结构和参数。

translated by 谷歌翻译